INFTY: An integrated OCR system for mathematical documents
Identifieur interne : 001790 ( Main/Exploration ); précédent : 001789; suivant : 001791INFTY: An integrated OCR system for mathematical documents
Auteurs : Masakazu Suzuki (mathématicien) [Japon] ; Fumikazu Tamari [Japon] ; Ryoji Fukuda [Japon] ; Seiichi Uchida [Japon] ; Toshihiro Kanahori [Japon]Source :
Descripteurs français
- Pascal (Inist)
- Wicri :
- topic : Mathématiques.
English descriptors
- KwdEn :
Abstract
An integrated OCR (Optical Character Recognition) system for mathematical documents, called INFTY, is presented. INFTY consists of four procedures, i.e., layout analysis, character recognition, structure analysis of mathematical expressions, and manual error correction. In those procedures, several novel techniques are utilized for better recognition performance. Experimental results on about 500 pages of mathematical documents showed high character recognition rates on both mathematical expressions and ordinary texts, and sufficient performance on the structure analysis of the mathematical expressions.
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000494
- to stream PascalFrancis, to step Curation: 000295
- to stream PascalFrancis, to step Checkpoint: 000554
- to stream Main, to step Merge: 001868
- to stream Main, to step Curation: 001790
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">INFTY: An integrated OCR system for mathematical documents</title>
<author><name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Faculty of Mathematics, Kyushu University</s1>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Tamari, Fumikazu" sort="Tamari, Fumikazu" uniqKey="Tamari F" first="Fumikazu" last="Tamari">Fumikazu Tamari</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Department of Information Education, Fukuoka University of Education</s1>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Department of Information Education, Fukuoka University of Education</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Fukuda, Ryoji" sort="Fukuda, Ryoji" uniqKey="Fukuda R" first="Ryoji" last="Fukuda">Ryoji Fukuda</name>
<affiliation wicri:level="1"><inist:fA14 i1="03"><s1>Department of Human Welfare Engineering, Oita University</s1>
<s3>JPN</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Department of Human Welfare Engineering, Oita University</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<affiliation wicri:level="4"><inist:fA14 i1="04"><s1>Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s3>JPN</s3>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Kanahori, Toshihiro" sort="Kanahori, Toshihiro" uniqKey="Kanahori T" first="Toshihiro" last="Kanahori">Toshihiro Kanahori</name>
<affiliation wicri:level="1"><inist:fA14 i1="05"><s1>Research Center on Educational Media, Tsukuba College of Technology</s1>
<s3>JPN</s3>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Research Center on Educational Media, Tsukuba College of Technology</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">05-0039231</idno>
<date when="2003">2003</date>
<idno type="stanalyst">PASCAL 05-0039231 INIST</idno>
<idno type="RBID">Pascal:05-0039231</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000494</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000295</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000554</idno>
<idno type="wicri:Area/Main/Merge">001868</idno>
<idno type="wicri:Area/Main/Curation">001790</idno>
<idno type="wicri:Area/Main/Exploration">001790</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">INFTY: An integrated OCR system for mathematical documents</title>
<author><name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4"><inist:fA14 i1="01"><s1>Faculty of Mathematics, Kyushu University</s1>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Tamari, Fumikazu" sort="Tamari, Fumikazu" uniqKey="Tamari F" first="Fumikazu" last="Tamari">Fumikazu Tamari</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Department of Information Education, Fukuoka University of Education</s1>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Department of Information Education, Fukuoka University of Education</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Fukuda, Ryoji" sort="Fukuda, Ryoji" uniqKey="Fukuda R" first="Ryoji" last="Fukuda">Ryoji Fukuda</name>
<affiliation wicri:level="1"><inist:fA14 i1="03"><s1>Department of Human Welfare Engineering, Oita University</s1>
<s3>JPN</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Department of Human Welfare Engineering, Oita University</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<affiliation wicri:level="4"><inist:fA14 i1="04"><s1>Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s3>JPN</s3>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName><settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author><name sortKey="Kanahori, Toshihiro" sort="Kanahori, Toshihiro" uniqKey="Kanahori T" first="Toshihiro" last="Kanahori">Toshihiro Kanahori</name>
<affiliation wicri:level="1"><inist:fA14 i1="05"><s1>Research Center on Educational Media, Tsukuba College of Technology</s1>
<s3>JPN</s3>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Research Center on Educational Media, Tsukuba College of Technology</wicri:noRegion>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Algorithm</term>
<term>Integrated system</term>
<term>Mathematical formula</term>
<term>Mathematics</term>
<term>Optical character recognition</term>
<term>Performance evaluation</term>
<term>Structural analysis</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Reconnaissance optique caractère</term>
<term>Mathématiques</term>
<term>Formule mathématique</term>
<term>Système intégré</term>
<term>Analyse structurale</term>
<term>Algorithme</term>
<term>Evaluation performance</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Mathématiques</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">An integrated OCR (Optical Character Recognition) system for mathematical documents, called INFTY, is presented. INFTY consists of four procedures, i.e., layout analysis, character recognition, structure analysis of mathematical expressions, and manual error correction. In those procedures, several novel techniques are utilized for better recognition performance. Experimental results on about 500 pages of mathematical documents showed high character recognition rates on both mathematical expressions and ordinary texts, and sufficient performance on the structure analysis of the mathematical expressions.</div>
</front>
</TEI>
<affiliations><list><country><li>Japon</li>
</country>
<region><li>Kyūshū</li>
<li>Préfecture de Fukuoka</li>
</region>
<settlement><li>Fukuoka</li>
</settlement>
<orgName><li>Université de Kyūshū</li>
</orgName>
</list>
<tree><country name="Japon"><region name="Kyūshū"><name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
</region>
<name sortKey="Fukuda, Ryoji" sort="Fukuda, Ryoji" uniqKey="Fukuda R" first="Ryoji" last="Fukuda">Ryoji Fukuda</name>
<name sortKey="Kanahori, Toshihiro" sort="Kanahori, Toshihiro" uniqKey="Kanahori T" first="Toshihiro" last="Kanahori">Toshihiro Kanahori</name>
<name sortKey="Tamari, Fumikazu" sort="Tamari, Fumikazu" uniqKey="Tamari F" first="Fumikazu" last="Tamari">Fumikazu Tamari</name>
<name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001790 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001790 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= Pascal:05-0039231 |texte= INFTY: An integrated OCR system for mathematical documents }}
This area was generated with Dilib version V0.6.32. |